S E M I N A R

 

Hypergraph-Partitioning Based Document Identifier Reassignment for Inverted File Compression

 

Izzet Çagri Baykan
MSc.Student
Computer Engineering Department
Bilkent University

The Inverted File is the most popular indexing mechanism for Information Retrieval Systems. Compressing an inverted file not only reduces space occupancy but also improves the overall retrieval performance since the disk access time decreases. The d-gap technique is used in compressing inverted files by replacing document identifiers with smaller gap values. However, fluctuating gap values cannot be efficiently compressed by well-known prefix-free codes. We propose a method for reassignment of document identifiers based on hypergraph-partitioning to smoothen and reduce the d-gap values, which will result in more efficient compression of the inverted files.

 

DATE: 5 November, 2007, Monday@ 16:50
PLACE: EA 409